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Abstract. We prove that in a random tournament the events {s — s> a} 
and {t — > 6} are positively correlated, for distinct vertices a, s,b,t £ 
K n . It is also proven that the correlation between the events {s — ¥ a} 
and {t — > b} in the random graphs G(n,p) and G(n,m) with random 
orientation is positive for every fixed p > and sufficiently large n (with 
m — |_P ( 2 ) J ) ■ We conjecture it to be positive for all p and all n. An 
exact recursion for P({s — » a} n {t — > b}) in G(n,p) is given. 



1. Introduction 

Let G be a graph on n vertices and a, 6, s, t € ^(G) four different vertices 
in the graph. Let further every edge in G be oriented either way with 
the same probability independently of each other. This model was first 
considered in [1], and a similar model was discussed in [3]. We will study 
the correlation between the two events {s — > a} and {t — > b}. Our main 
result is that these events are positively correlated for the complete graph 
and for two natural models of random graphs. Note however that it is easy 
to construct examples when the correlation will be negative, e.g. if G is the 
path on four vertices with edges sb, ba, at. 

The events {s — > a} and {s — > b} can be shown to have positive correlation 
for any vertices in any graph G. In [I] it was proven, somewhat surprisingly, 
that also the events {s — > a} and {b — > s} have positive correlation in K n , 
when n > 5, but negative correlation if G is a tree or a cycle. Further, in [2] 
it was shown that in the random graph models G(n,p) and G(n,m) for a 
fixed probability p (= m/^)) aud large enough n the correlation between 
{s — > a} and {b — > s} is negative if p is below a critical value and positive 
if p is above the critical value. The critical value in G(n,p) was exactly 1/2 
and in G(n, m) approx. 0.799. 

The situation in this paper turns out to be different. We prove positive 
correlation when G is K n and in G(n,p) and G(n, m) for fixed p > and n 
sufficiently large. We conjecture that it is in fact non-negative for all pairs 
n,p. 

For technical reasons we will study the complementary events A := {s 
a}, the event that no directed path from s to a exists, and B := {t -/» b}. 
Note that the events A and B have the same covariance as the events {s — > a} 
and {t — > b}. 

The paper is organized as follows. In section [2] we present a lower bound 
for ¥(Af]B) and prove that A and B are positively correlated for n > 4. An 
intuitive explanation is due to the fact that the biggest terms of ¥(A) comes 
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from when no edges are directed from s and when no edges are directed to 
a, analogously for ¥(B). We also show that the relative covariance of the 
two events converges to 2/3 as n — > oo. 

In section [3] we consider the random graph G(n,p) on n vertices. It 
is a random graph model in which every edge exists with probability p 
independently of each other and then every existing edge is directed in either 
of the two directions with the same probability independently of all other 
edges. Note that the two random prosesses can be combined in two different 
ways. In this paper we study the joined probability space of G(n,p) and 
that of egde orientations, which we call G(n,p). This will be refered to as 
the annealed version. The other possibility, the quenched version, will be 
briefly discussed in section [6) We prove that for fixed p > and sufficiently 
large n the events A and B will be positively correlated in G(n,p). 

In Section [5] we study the random graph model G(n,m), with uniform 
distribution among all graphs with n vertices and m edges. Note that in this 
graph the edges does not exist independently of each other since the number 
of edges in the graph is fixed. As before every existing edge is directed in 
either way with equal probability independent of all other edges. We prove 
that for fixed p = m/QHhe events A and B are positively correlated for 
sufficiently large n. 

In Section [5] we give an exact recursion to compute P(^4 n B) in G(n,p) 
which supports our conjecture that the correlation is positive for all values 
of n and p. 

The problems studied here was first motivated by the, so far in vain, 
attempts to prove the so called bunkbed conjecture, see [5]. 

2. Correlation in a random tournament 

To show that the correlation between A and B is positive we need a 
sufficient upperbound for F(A) (and P(-B)) and a lower bound for F(AnB). 
Both an upper bound and a lower bound for F(A) was given in [T]: 

Lemma 1 (Theorem 2.1 in pQ). For all n>2, 

The next lemma gives a lowerbound for the probability of the event A C\B. 
Lemma 2. For all n > 4, 

Proof. Define I a ^ to be the set of points in [n]\{a, b} that can reach a or 6 in 
one step, that is with a single edge directed to a or b. Similarily define O s j 
to be the set of points in [ro]\{s,t} that can be reached from s or t in one 
step. Define further I a and to be the set of points in [ra]\{a} and 
respectively that can reach a and b respectively in one step, and finally in 
the same way define O s and Ot to be the set of points in [n]\{s} and [n]\{£} 
respectively that can be reached from s and t respectively in one step. 
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The four events I a ^ = 0, O s> t = 0, I a = 0% = and lb = O s = all implies 
AD B. Hence we have 

F(A HB)> F((I a , b = 0) U (O s>t = 0) U (J a = Ot = 0) U (I b = O s = 0)). 

By inclusion-exclusion we have 

P((4 )6 = 0) u (O s , t = 0) u (J a = o t = 0) u (4 = O s = 0)) = 

2(n-2) 



+ 




since the events (I a = 0) and (lb = 0) are disjoint and so are the events 
{O s = 0) and (O t = 0). □ 

Theorem 1. The events A = {s a} and B = {t -ft b} are positively 
correlated for n > 4. 

Proof. From Lemmas [T] and [2] we get 

¥(A n B) - P(A) F(B) = F(A (IB) — (F(A)) 2 > 

n—A / 




1 + 3.2 




The cases 4 < n < 12 were checked using Lemma [2] and the values of 
computed by recursion in pQ . The (rounded) values used are listed below. 

n F(A) 



4 0.25 

5 0.146484 

6 0.076416 

7 0.036942 

8 0.017427 

9 0.008309 

10 0.004038 

11 0.001988 

12 0.000986 



□ 

We can also giv6 cin upper bound, for IP(yln-B) to show that lini^—^QQ F(An 
B) ■ 2 2n ~ A = 3 and limn^oo P ^ n ^| A p^ F ^ = | . These statements are 
special cases of Theorems [2] and [3] below. 
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3. Random orientations of G(n,p). 



Let as usual G(n,p) be the random graph in which every edge exists 
with probability p independently of the other edges. We also let every 
edge be directed in either way with equal probability independently of each 
other. We will call the corresponding random graph model G(n,p). For this 
section, let x = p/2 be the probability of one edge to exist and be directed 
in a certain way and let y = 1 — x be the probability of an edge not to exist 
in a certain direction. We will adopt the usual notation / ~ g to denote 
that the quotient of / and g goes to a constant. In [2 J the following lemma 
was proven. 

Lemma 3 (Lemma 4.2 in [2\). For any vertices s,a in G(n,p) 

P(A) ~ 2y n -\ 

Clearly, ¥(A) = ¥(B). To find the relative correlation between A and B 
when n approaches infinity we need an estimate of ¥(A n B). 

A set X of vertices in K n is said to be an inset (outset) if all existing 
edges from [n]\X are directed to (from) X. Let I x be the event that X is 
an inset. Let also 

Z k = |J I x and Z' k 

x : s e x 

a^X 
\X\ = k 

Now we have 

n-1 

¥(s ^ a) = P( |J Z k ) 

k=l 

and 

n— 1 n— 1 

¥(AnB) = ¥(s ^ a,t ^b) =¥([j Z k n{J Z' k ). 

k=l k=l 

Theorem 2. For p € (0, 1] we have 

,. n^riB) 

lim — 2^r~ = 4 ~p 

Remark 1. Exact computations indicate that this convergence is very slow 
for small p, see Figure [2] in Section [5j 

Proof. First note that P(A D B) = P(U^}Z fc n ^lZ\Z' k ) = s x + s 2 + s 3 - 
s 4 , where Sl = ¥(U n k zlZ k n U n k Z{Z' k ), s 2 = P(U^ 1 1 Z fe n ^Z 3 3 Z' k ), s 3 = 

P(H =1 z k u^_ 2 z k ) n H =1 z' k u^_ 2 z;)) and , 4 = P(u«- 3 3 z fe nu«- 3 3 z0 

By symmetry si = s 2 and clearly 54 < s\. We will write ¥^(I X ) for P(I' Y ) 
with |X| = N. We show that si, ^2,34 are negligible compared to S3, and 
give an estimate of S3. Starting with si, first note that Pfc(/"' <: ) = y k ( n ~ k ) 
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U 



x' ■. t e x' 
b $ x' 

\X'\ = k 



and if k < I < § we have P;(7 y ) < W k (I x ). This gives us 

n— 3 rt— 1 

ai = P(|Jz fc nU4) 

fc=3 fe=l 

n— 3 

< nU^) 

fc=3 
fc=3 v 7 

fc=3 V 7 k=K V 7 

Now, since p is fixed we may fix K such that < The sum 

J^/E^ 1 {}^Li)y k ^ n ~ k ^ is finite and it is 0(y^ n ~^) which is very small com- 
pared to y 2n , and hence negligable. Further we get 

n-K 

\ y k(n-k) < 2 n. y K(n-K) 



E 

k=K 



n 
k-l 



3 \ n—K 

< 2"(y) =0(y 3 ")- 



That is si ~ o(y 2n ) and analogously so is S2 and S4. 

To estimate S3, first consider P(Zi C\ Z' 2 ) as an example. In this case no 
edges will be directed from s. For the inset X' we have two subcases, either 
it contains s and t or t and another vertex (different from s, b). In the first 
case we get a total of y 2n ~ 3 , and for the second case we can choose X' in n — 3 
ways and no edges will be directed from X' , this gives us (n— 3)y 3n ~ 9 (l— p) 2 . 
In the computations below it will always be the case that if three or more 
vertices are involved. Then the probability will be negligable, i.e. o(y 2n ). 

We get four contributing cases which can be reduced to two by symmetry. 

(1) P((Zi U Z 2 ) n (Z[ U Z' 2 )) = y 2n - A + o{y 2n ). 

(2) n(Zn-i U Z„_ 2 ) n U Z' n _ 2 )) = y 2 «- 4 + o(y 2n ). 

(3) P(Zi n = y 2n - 3 . 

(4) P(Z„_! n Z[) = y 2n ~ 3 . 

For (1) we see that if any other vertex than s and t is in the insets for s 
and t we will have conditions on at least 3n — 9 edges and thus a probability 
of size o(y 2n ). All the interesting cases are when we have no restriction on 
the possible edge between s and t, and no edge must be directed from s,t 
to any other vertex. Note that our example above is a subset of this case. 
Case (2) is symmetric to (1). 

For (3) no edge may be directed from s or to b, which imposes conditions 
on In — 3 edges. Case (4) is symmetric to (3). One can easily check that the 
remaining six possibilities, four cases symmetric to Z\ n Z' n _ 2 and two cases 
symmetric to Z 2 D Z' 2 , all have probabilities of size o(y 2n ) and can hence be 
ignored. 



All together we end up with 2y 2n " 4 + 2y 2n ~ 3 + o(y 2n ) = 2y 2n " 4 (l + (1 - 
l)) + o{y 2n ) = y 2n - A {A-p) + o{y 2n ). ' □ 

Theorem 3. For fixed p 6 [0, 1] 

P(A n 5) - P(A) F(B) _ p(3 - p) 
n^L ¥{A~nB) ~~ 4-p 

Proof. Follows from Lemma [3] and Theorem [2j □ 

Corollary 4. For a fixed p € (0,1], the correlation between A and B is 
always positive for sufficiently large n. 

We believe that something stronger is true and we offer the following 
conjecture, which is supported by our calculations in Section [5j 

Conjecture 1. For any n > 4 and p £ (0,1], the events {s — > a} and 
{t —¥ b} are always positively correlated. 

4. Random orientations of G(n,m) 

In this section we study the same problem on the random graph G(n, m) 
where each simple graph with m edges and n vertices is equally likely. We 
will also here let every edge have an independent direction and call the 
combined probability space G(n,m). Again, let y = 1 — |, let further 
q(l) = q(l; n, m) be the probability that I fixed edges in K n does not exist in 
G(n,m) with given directions. In G(n,p) this corresponds to y l . If nothing 
else is written the graph considered in this section is always G(n,m). 

In [2] the following lemma was prooven. 

Lemma 4 (Janson, Lemma 3.2 in [2j). Suppose that < m = m(n) < (o)- 
Then with p = p(n) = m(n) /Co), as n — >■ oo, 

n ^ l ( { 1 YpO--p)\ 
q{l;n,m)~y ^[~[-) ) , 

and for any I, n, m we have q(l; n, m) < q'(l; n,p). 

This lemma together with the proof of Theorem [2] gives us an analogue 
result of Theorem [2] for G(n,m). 

Theorem 5. In the case of G(n,m) for fixed < p < 1 we have 
F(AHB) ~ 2y 2 "- 4 exp(-4^-^ 

Also we need the following lemma. 
Lemma 5 (Lemma 4.3 in [2]). For fixed < p < 1 

p{l-p) 



¥{A) ~ 2y n ~ 1 exp 



{2-pf 



We are now ready to state and prove the main theorem of this section. 
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Theorem 6. For fixed < p < 1 and sufficiently large n, the events A and 
B are positively correlated and the relative covariance is 

\'2 



2-f exp { 2 (2- P r) 



2 

Proof. We rewrite the relative covariance as 

P(A n B) - F(A) F(B) _ i F(A) F(B) 



n B) F{A n 5 

As n approaches oo, Theorem [5] and Lemma [5] gives 

P(5) 4 ^ ex P [~ 2 WW 



F(AnB) 2 y « exp (_ 4 KiJ) (2 _ i) 



E (2-P) 2 



f) -"V (2-p) ; 



Let us denote this expression by /. It remains to prove that / is less than 
one when < p < 1. This can be proven by using the derivative of /. We 
have that 

2(i~p)p p3 _|_ ^2 _ g 



1- 



f'(p) = e 

/W (4-p) 2 (2-p) 
The theorem follows since the derivative is negative in this interval and 
/(0) = 1. 

□ 

We conjecture the covariance to be positive at all times. 

Conjecture 2. The events A and B are positivelly correlated in G(n,m) 
for all p and all n. 

Note that the covariance of G(n,p) is always less than the covariance of 
G(n,m) (see [2]). So the conjecture would also imply the correlation to be 
positive in G(n,p). 

5. Exact recursion in G(n,p). 
In this section we will give an exact recursion to compute 

Together with the recursion given for f n (p) := F^^ np ^(a 7^ s) in [2] we 
will be able compute the covariance for n as a rational function in p. Our 
computations for n < 34, using Maple, supports our Conjecture [T] that the 

covariance is always positive, see Figure [TJ 

-> 

For a vertex v 6 V(G), let C v C V(G) be the (random) set of all vertices 

— > 

u for which there is a directed path from v to u. We say that C v is the 

out-cluster from v. Let analogously the in-cluster, C v C V(G) be the 
(random) set of all vertices u for which there is a directed path from u to 

v. Note that we will use the convention that v € C v T\ C v . Let as before 
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Figure 1. The relative covariance 



(g 7 4s,f 7 ^fe)-P(g 7 ^s) P(f/>o) 

in G(n,p) for going from right to left n = 
6 (green), 8, 10, 12, 14, 16, 18,20,22 (blue), and the asymp- 
tote p(3 — p)/(4 — p). All curves are positive for < p < 1. 




Figure 2. Plots of ^"^lY^ in G(n,p) for going from right 
to left n = 6 (green), 8, 10, 12, 14, 16, 18, 20, 22 (blue), and 
the asymptote 4 — p. See Theorem [2j 



y := 1 —p/2 be the probability that an edge does not exist with a certain 
direction, and let q := 1 — p be the probability that there is no edge at all. 
For n > 1, s G S C [n] and IS 1 ) = & define: 

dp(n,A0 :=F 8M (C S = S), 
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where in particular d p (l,l) = 1. A recursion to compute d p (n,k) as a 
polynomial in p was given in [2]. 

Lemma 6 (Lemma 5.1 in [2]). We have the following recursions 
d p (n,k) = d p (k,k)y k ( n - k \ for n > k > 1, 

and 

k-l 

Mi/ <(fc ~°- 



dp (k,k)=i-^( k l _iy 



Note that, by symmetry, also ^Q^ np ^(Cs = S) = d p (n,k). 

It turns out that the following quantity is possible to compute recursively 
and enables us to compute h n (p). For re > 2, t 6 T C [n], a £ ^4 C [re] with 
|T| = t, |A| = a and |[n] \ (AUT)| = r define: 

N p (n, r, a, r) := (C t = T, C a = A), 

where in particular iV p (2, 2, 2,0) = x and -/V p (2, 1, 1, 0) = y. 

We will use the variable j for the size of the intersection \A DT|. If there 
is any intersection between A and T then o, t € iflT, so in particular 
j = a + t — (re — r) can never be 1. 

Theorem 7. We have the following recursions for N p , where r+ct > n—r > 
t, a and t, a > 1 

(i) JV p (n, r, a, r) = N p (n-r, r, a, Q } q r(r+T+a-n) y r(2n-2r-T-a) ^ y 0f r > Q> 

(ii) N p (n,T,a,r) = N p (n,a,T,r), 

(iii) JV p ( re ,r,a,0) = £( n ^^ 1 )iV p (n-C,r,a-C,0)d p (C,Ck (C - 1)(a+r - n) - 

y (C-l)(2n-r-a-C) ( y r _ y 2n-a-r~C g a+r-n^ ? / r re > T, n > a > 2, j > 2, 

(iv) N p (n, t, a,0) = Y J [ ( _ x ) N p( n - C, r, a - £ 0)d p (C, C> 



( ^- |,| -+' 1 -0 !/ t ( l , /or a > 2,j = i.e. n = r + a, 



y 

(v) iV p (re, n,re, 0) 



re — 2\ sr-^ (n — t — 1\ 

A p (n,r,a,n - a - r), 

a — 1 / 



e : e 

r=l v y a=l 



Proof. For the first equation we have r > 0, thus [re] \(iUT) is non-empty 
and no vertex in that set must not have any edge directed to A or from 
T. Hence there must be no edge at all to A (~)T, which gives probability 
gflN\^uT| |AnT| _ qr{r+T+a-n) _ There must not be any edge directed to 

(A \ T) and there must not be any edge directed from (T \ A). This gives 

the probability of y\H\AUT\-\(A\T)U(T\A)\ = y r(2n-2r-r-a) _ 

The second equation is obtained from the symmetry of reversing all di- 
rections and switching the roles of a and t. 
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For equation (iii) and (iv) , we pick a vertex z € A\T, such a vertex 
exists by the assumption n > r and r = 0. Let G be any directed graph 

on n vertices with Ct = T and C a = A. If we remove vertex z and all its 

— > 

edges from G the resulting graph will still have Ct = T since z £ T, whereas 

C s = A\Z, for some Z C yl\{a} such that Z n T = 0. This follows from 
the fact that the vertices in Z are those that have a path to a only via z 
and no vertex in T has a directed path leading to z by assumption. Let 
( = \Z\ and sum over all possible Z. The probability is N p (n — (, r, a — (, 0) 
that the subgraph on [n] \ Z is as needed. The subgraph on Z must have 



C z = Z which has probability d p (j,j). Let us first consider equation (iii" 
when j = t + a — n > 2. 

There must not be any edge between TdA and Z\{z}, since the vertices 
of the latter do not belong to T and have all directed paths via z. This gives 
a factor g(f- 1 )( Q + T -™) . No vertex of Z \ {z} can have an edge to A \ (T U Z) 
or from T \ A , which gives a factor yK~ 1 )( 2n-r ~ a ~0_ Finally, we must 
consider the edges of z. The main condition is that there must not be any 
edge from T to z. However, there must be at least one edge edge directed 
from z to A \ Z. This give the last factor. The case of equation (iv) when 
j = is easier and obtained similarly. 

Equation (v) follows from the fact that for fixed n 

E V dM t = T,C a = A) = l. 

T,A:aeA,teTC[n] 

Here j = \A(~] T\ and recall that j = 1 is not an option. 

□ 



Theorem 8. 



We have the following expression for ^^r np \( a A A 



71 — 2 / ,\ 

w^^ 6 )=£ (": 2 • 

/ n - 2 f n -2- j\ n - T+j - 1 f n _ T _ A 
E( T _ j ) E ( a _ j )N p (n,T,a,n-a-T+j) 

\ T =j V 7 a=j 

" 1 fn-2- j\ n ~ T+j (n-r\ \ 
+ E ( r - ? _l) E ( .)N p (n,T,a,n-a-T + j)\ 
r=j+i \ J / a= j \ j/ / 

+ E("_i) E (^ali 2 ^)N p (n,T,a,n-a-T) 



a=l 
-2 / .\ n 



— ( 4\ — Z' l\ 



a 

V a — 1 / 

Proof. The equation for lP^( np )( a Asjt^i) is obtained by summing over 
all possible pairs A, T such that s ^ A,b ^ T. Again j = n T| and the 
formula is split into four cases depending on if s E T or not and if j = or 
not. □ 
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Note that in G(n,p) the functions P(s -ft- a) and P(s -/> a,t -/} b) are 
polynomials in p and hence continuous. 

6. The Quenched model 

For the quenched version the correlation between A and B is computed for 
each graph in G(n,p) (G(n, m)) in the probability space of edge orientations 
and then the expected value is taken over all graphs. 

We computed the covariance between A and B for G(n,p) as a function 
over p, in both the annealed and the quenched model for n < 6. The two 
cases looks quite similar, see Figure El Note that for n < 6 the covariances 
are positive also for small p and we conjecture it to be positive for all n. 
This differs from the behavior for the similar problem studied in Section 9 
in p]. It would also be intresting to find an analouge to Theorem [3] for the 
quenched model. 
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Figure 3. The covariance for G(6,p). The dashed curve 
represents the annealed case and the continous one the 
quenched case. 
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